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SUMMARY 


The  goal  of  this  project  is  to  isolate  the  gene  for  human 
serum  cholinesterase  (EC  3. 1.1. 8).  We  have  isolated  a  cDNA  clone 
from  a  human  brain  basal  ganglia  cDNA  library,  that  codes  for  a 
portion  of  the  cholinesterase  enzyme.  The  clone  contains  1440  bp, 
of  which  1138  bp  are  in  the  coding  region,  and  302  bp  are  in  the 
3'  untranslated  region.  The  coding  region  corresponds  perfectly 
to  the  amino  acid  sequence  of  human  serum  cholinesterase  starting 
at  phenylalanine  195  to  leucine  574  at  the  carboxy  terminus.  This 
confirms  that  our  cDNA  clone  codes  for  cholinesterase  rather  than 
for  acetylcholinesterase  (EC  3. 1.1. 7).  Furthermore  it  leads  to  the 
conclusion  that  the  amino  acid  sequence  of  cholinesterase  from  two 
different  tissues,  from  human  serum  and  human  brain,  is 
identical. 

The  strategy  for  finding  the  cDNA  clone  depended  on  our 
knowledge  of  the  amino  acid  sequence  of  human  serum 
cholinesterase.  Our  laboratory  had  determined  the  complete  amino 
acid  sequence,  and  had  designed  12  oligonucleotide  probes 
corresponding  to  regions  containing  minimum  codon  ambiguity. 
These  oligonucleotide  probes  were  successfully  used  to  find  the 
cDNA  clone  for  cholinesterase.  We  are  continuing  to  search  the 
brain  cDNA  library  for  other  cDNA  clones  that  will  extend  our 
present  clone  in  the  5'direction  so  as  to  have  the  complete  coding 
region  of  cholinesterase. 
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INTRODUCTION 


The  goal  of  this  project  is  to  isolate  the  gene  for  human 
serum  cholinesterase.  Our  strategy  for  finding  the  gene  was  based 
on  our  previous  results,  where  we  had  determined  the  complete 
amino  acid  sequence  of  human  serum  cholinesterase.  Knowledge  of 
the  amino  acid  sequence  allowed  us  to  design  oligonucleotide 
probes  that  would  selectively  hybridize  with  the  gene  for  human 
serum  cholinesterase.  We  have  used  these  probes  to  screen  a  human 
brain  cDNA  library  and  have  found  one  positive  clone.  The  DNA 
sequence  of  the  positive  clone  corresponded  to  a  portion  of  the 
amino  acid  sequence  of  human  serum  cholinesterase,  starting  at 
phenylalanine  195  and  extending  past  leucine  574  at  the  carboxyl 
terminus,  into  the  3'  untranslated  region. 

Isolation  of  the  clone  for  human  serum  cholinesterase  will 
allow  large  scale  production  of  this  enzyme  via  recombinant  DNA 
technology.  This  is  of  military  importance  because  cholinesterase 
has  the  property  of  being  highly  reactive  with  organophosphate 
esters  of  the  type  used  in  nerve  gas.  Cholinesterase  could  be 
used  as  a  protective  agent,  to  protect  personnel  against  the  toxic 
effects  of  nerve  gas.  It  could  also  be  used  to  clean  up  spills  in 
situations  where  a  chemical  agent  such  as  sodium  hydroxide  would 
be  too  corrosive  for  practical 1  purposes . 


METHODS 
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01 igonucleotide  probes .  Twelve  oligonucleotide  probes  were 
made.  The  probes  are  listed  in  Table  1.  The  probes  were  based  on 
the  amino  acid  sequence  of  human  serum  cholinesterase  (1)  . 
Regions  of  the  amino  acid  sequence  representing  minimum  codon 
ambiguity  were  chosen.  The  location  of  each  probe  within  the 
complete  amino  acid  sequence  is  illustrated  in  Table  2.  Probes  6 
and  7  were  synthesized  by  the  University  of  Michigan  DNA 
synthesizing  facility.  All  other  probes  were  synthesized  by 
PL-Biochemical  Corp.,  Milwaukee,  WI .  Probes  1,  3,  and  4  represent 
the  coding  strand  of  the  amino  acid  sequence.  All  other  probes 
represent  the  reverse  complement. 

Three  different  principles  were  used  in  designing  these 
probes.  Probes  1-5  are  mixed  oligonucleotides  containing  all 
possible  DNA  sequences  corresponding  to  the  selected  amino  acids. 
Probes  6  and  7  are  unique  sequence  probes;  in  every  position  where 
more  than  one  nucleotide  could  have  been  used  for  a  codon,  a 
single  choice  was  made.  Unique  sequence  probes  are  longer  than 
mixed  probes  to  compensate  for  the  likelihood  of  a  mismatch. 
Probes  8-12  contain  inosine  at  every  position  where  more  than  one 
nucleotide  could  be  chosen. 

The  following  variables  were  considered  in  designing  our 
unique  sequence  probes  (2) .  The  amino  acid  sequence  of 
cholinesterase  was  examined  to  find  the  longest  region 
corresponding  to  the  least  codon  ambiguity.  Two  such  regions  were 
found.  The  region  EWGNNAFFY YFEH  corresponded  to  39  nucleotides 
having  an  83.3%  probability  of  being  homologous  to  the  DNA 
sequence.  This  probability  was  calculated  from  the  observations 
that  all  13  amino  acids  in  this  region  had*  fixed  bases  in 
positions  1  and  2  of  their  codons,  and  that  the  3rd  base  was  fixed 
for  1  amino  acid  (W) ,  but  could  be  1  of  2  choices  for  5  amino 
aicds  (E,N,F,Y,H) ,  and  could  be  1  of  4  choices  for  2  amino  acids 
(G,A) .  A  second  region  was  WPEWMGVMHGYEIEFVFG  corresponding  to  53 
nucleotides  having  an  85.0%  probability  of  being  homologous  to  the 
DNA  sequence.  Codon  degeneracy  was  limited  to  the  3rd  base  in 
each  codon;  4  amino  acids  had  no  degenerate  codons,  7  amino  acids 
had  a  choice  of  2  codons,  and  6  amino  acids  had  a  choice  of  4 
codons.  These  two  regions  were  located  within  6  amino  acid 
residues  of  each  other.  This  proximity  increased  the  chance  that 
a  clone  would  be  represented  by  both  regions,  and  therefore  that  a 
positive  recombinant  clone  might  be  positive  with  both  probes. 
This  is  important  for  eliminating  false  positives. 

To  further  increase  the  likelihood  of  homology,  codon  usage 
was  considered.  Codon  usage  in  human  blood  coagulation  factor 
VIII  (3)  was  analyzed  to  determine  which  codons  were  used  most 
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frequently.  Human  blood  coagulation  factor  VIII  was  selected  as  a 
model  because,  like  cholinesterase,  it  is  a  glycoprotein  that  is 
synthesized  and  secreted  by  the  liver.  Furthermore,  it  is  present 
in  low  concentration  in  plasma,  suggesting  that  its  mRNA  belongs 
to  the  category  of  "weakly  expressed"  mRNA  (4) .  Grantham  et  al 
(4)  found  a  correlation  between  codon  usage  and  whether  an  mRNA 
was  highly  or  weakly  expressed.  Codon  usage  in  factor  VIII  showed 
a  preference  for  Ala  (GCT) ,  Asp  (GAT) ,  Glu  (GAA) ,  Gly  (GGA) ,  Lys 
(AAA) ,  and  Val  (GTC) .  The  other  amino  acids  had  no  clear  codon 
choices.  G  was  infrequently  used  in  the  3rd  positions  of  Pro, 
Ser,  Thr,  and  Ala.  Codons  for  those  amino  acids  having  no  clear 
preference  in  codon  usage  were  selected  by  following  the  rule  that 
a  G:T  mismatch,  though  less  stable  than  a  G:C  pair  still 
contributes  to  hybridization  stability  whereas  an  A:C  mismatch 
does  not  (2) .  Thus  G  was  chosen  for  the  3rd  position  when  the 
choice  was  G  or  A,  and  T  was  chosen  for  the  3rd  position  when  the 
choice  was  T  or  C.  Following  this  rule,  we  chose  T  for  the  3rd 
positions  of  Asn,  His,  Phe ,  Tyr,  and  lie.  We  also  chose  T  for  the 
3rd  position  of  Pro,  because  G  was  infrequently  used. 

The  use  of  deoxyinosine  at  positions  corresponding  to 
ambiguous  nucleotides  was  introduced  by  Ohtsuka  et  al  (5).  They 
synthesized  23mer  and  26mer  probes,  each  containing  five 
deoxyinosines ,  and  successfully  used  these  to  find  a  genomic 
clone.  Deoxyinosine  does  not  contribute  to  the  stability  of  a 
nucleotide  pair,  but  neither  does  it  destabilize.  For  these 
reasons  we  designed  a  series  of  inosine  probes.  These  are  probes 
#8,9,10,11,  and  12  in  Table  1. 

.  32 

Probes  were  labeled  at  the  5'  position  with  gamma  P-ATP  and 
T4  polynucleotide  kinase. 

cDNA  library.  A  cDNA  library  from  the  basal*  ganglia  of  a 
1-day  old  human  brain  was  from  Dr.  Robert  A.  Lazzarini  (6).  This 
library  can  now  be  purchased  from  the  American  Type  Culture 
Collection,  Rockville,  MD.  The  cDNA  was  in  vector  lambda  gtll  to 
which  it  was  linked  via  Eco  Rl  linkers.  The  library  had  first 
been  screened  by  Dr.  Charles  McTiernan  in  the  laboratory  of  Dr. 
Terrone  Rosenberry  at  Case  Western  Reserve  University  Medical 
School.  Dr.  McTiernan  had  screened  the  library  .th  our  53mer 
unique  sequence  probe  and  had  found  several  positive  clones.  Upon 
sequencing  one  of  these  clones,  he  found  that  it  me  '  ed  our  amino 
acid  sequence  for  human  serum  cholinesterase,  which  uad  not  yet 
been  published.  Because  of  this  collaboration,  we  knew  that  this 
particular  library  contained  clones  for  human  cholinesterase. 

Screening  the  cDNA  library.  The  in  situ  plaque  hybridization 
method  of  Benton  &  Davis  (V)  as  modified"  by  Maniatis  et  al  (8)  was 
used  to  screen  150,000  plaques  spread  on  20  plates,  containing  47 
ug/ml  ampicillin.  The  host  was  E.  col i  strain  Y1088  (9).  Four 


lifts  were  made  from  each  150  mm  plate  onto  four  separate 
nitrocellulose  filters  (Millipore,  HATF  137  50,  pore  size  0.45 
urn).  Each  filter  was  hybridized  with  a  different  probe  or  set  of 
probes.  Pre-hybridization  was  in  0.25%  Carnation  instant  dry 


milk,  6  x  SSC  (6 
at  60  °  for  2  to 
Carnation  instant 
for  12  to  16  h. 
specific  for  each 


x  SSC  is  0.9  M  NaCl ,  0.09  M  NaCitrate,  pH  7.0), 
16  h  (10).  Hybridization  was  in  50  ml  of  0.25% 
dry  milk,  6  x  SSC  containing  120  uCi  of  probe. 
The  hybridization  and  washing  temperatures  were 


filters  were  washed  at  room  temperature  in  6  x  SSC,  0.05% 
pyrophosphate,  and  then  at  46°  for  10  min  in  2  x  SSC,  0.1%  SDS. 
The  39mer  probe  was  hybridized  at  36°;  filters  were  washed  at  room 
temperature  in  6  x  SSC,  0.05%  pyrophosphate  and  then  at  46°  for  10 
min  in  6  x  SSC,  0.05%  pyrophosphate.  Probes  #8,  #10,  and  #12  in 
Table  1,  were  used  together.  They  were  hybridized  at  36°;  filters 
were  washed  at  room  temperature  in  6  x  SSC,  0.05%  pyrophosphate, 
followed  by  10  min  at  43°  in  the  same  buffer.  Probes  #4,  #9,  and 
#11  were  used  together.  They  were  hybridized  at  36°;  filters  were 
washed  at  room  temperature  in  6  x  SSC,  0.05%  pyrophosphate, 
followed  by  10  min  at  41°  in  the  same  buffer. 


probe. 


53mer  probe  was  hybridized  at  46 


pyrophosphate , 


Purification  of  DNA .  This  procedure  is  described  in  detail 
because  it  is  unpublished.  It  was  adapted  from  David  Law's  thesis 
(11) .  The  advantage  of  this  method  is  that  it  avoids  a  long 
centrifugation  in  cesium  chloride. 

PREPARATION  OF  HOST  CELLS.  A  single  colony  of  Y1088  was 
inoculated  into  50  ml  of  NZYM  medium  supplemented  with  0.25  ml  of 
1%  thymidine.  NZYM  medium  was  made  by  dissolving  10  g  casein 
enzymatic  hydrolysate  from  Sigma,  5  g  NaCl,  5  g  bacto-yeast 
extract  (Difco  Labs),  2  g  MgS0..7H_0  in  water  to  a  total  volume  of 
1  liter  and  adjusting  the  pH^to  7.5  with  NaOH.  Bacteria  were 
grown  for  11  h  at  37°C. 

ADSORBTION  OF  PHAGE  TO  HOST  CELLS.  To  1  ml  of  the  fresh 
overnight  culture  of  Y1088,  were  added  5  ml  of  phage  containing 
5.6  x  10b  pfu,  1  ml  of  50  mM  calcium  chloride,  and  15  ul  of  1  M 

gfllV.  if  S!ini¥?8i&?!$  £lSeti8feifgnfffcaiilfHjlatgft!)umgef31?£ 

phage  were  used,  the  yield  of  DNA  decreased  drastically.  .  The 
phage  were  allowed  to  adsorb  to  the  bacteria  for  10  min  at  37°. 

PHAGE  GROWTH.  The  phage-Y1088  mixture  was  added  to  1  liter  of 
NZYM  medium  supplemented  with  5  ml  of  1%  thymidine.  The  medium 
was  in  a  2  liter  Fernbach  flask.  It  was  shaken  at  200  rpm  on  a 
shaking  table  at  37  for  18  h.  After  18  h  the  appearance  of  the 
medium  was  cloudy  with  white  swirls  and  small  white  pieces  of 
debris.  58  g  NaCl  and  20  ml  chloroform  were  added  to  further  lyse 
the  bacteria.  The  medium  was  shaken  an  additional  15  min.  It  was 
centrifuged  at  8000  rpm  for  10  min  at  4°C  to  remove  cell  debris. 
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PEG  PRECIPITATION.  To  the  clear  supernatant  was  added  100  g 
of  polyethylene  glycol  (PEG  8000  from  Sigma)  and  12  ml  of  1  M 
MgSO..  This  was  slowly  stirred  at  4°  for  1  h,  and  then 
centrifuged  for  10  min  at  4°  at  8000  rpm.  Two  centrifuge  cups 
were  used  and  pellets  were  stacked  on  top  of  each  other. 

CHLOROFORM  EXTRACTION  OF  PEG  PELLET.  Pellets  were  suspended 
in  SM.  SM  was  prepared  by  adding  5.8  g  NaCl,  2  g  MgS0..7H  0,  50  ml 
of  1  M  TrisCl  pH  7.5,  and  5  ml  of  2%  gelatin  to  a  total  volume  of 
1  liter  in  water  and  autoclaving.  5  ml  of  SM  was  added  to  each 
centrifuge  bottle  and  the  suspension  was  gently  homogenized  in  a 
glass  homogenizer.  The  pooled  suspension  was  poured  into  one  30 
ml  corex  tube.  10  ml  of  chloroform  was  added,  and  the  liquids 
mixed  by  drawing  into  a  pasteur  pipette.  This  was  centrifuged  at 
8000  rpm  at  4  for  15  min.  *  The  chloroform  layer  went  to  the 
bottom,  and  the  aqueous  layer  to  the  top  of  the  tube.  The  aqueous 
layer  was  transferred  into  4  thick  walled  SW50.1  centrifuge  tubes, 
and  centrifuged  in  an  SW50.1  swinging  bucket  rotor  (Beckman  Co.) 
at  20,000  rpm  for  2  h  at  4°.  A  small  yellowish  precipitate 
contained  the  DNA. 

PROTEINASE  K  AND  RNAse  DIGESTION.  The  supernatant  was 
discarded.  150  ul  of  SM  was  added  to  each  tube  and  the  pellet 
suspended  by  shaking  at  room  temperature  for  1  h.  The  total 
volume  was  adjusted  to  2  ml  .with  SM.  EDTA  was  added  to  a  final 
concentration  of  20  mM;  SDS  to  0.5%;  proteinase  K  (Sigma)  to  50 
ug/ml:  and  RNAse  to  20  ug/ml .  The  mixture  was  incubated  1  to  12  h 
at  65°. 

PHENOL/CHLOROFORM  EXTRACTION.  The  solution  was  extracted  with 
1  volume  of  phenol,  followed  by  1  volume  of  phenol/chloroform 
(1:1),  and  finally  1  volume  of  chloroform.  The  DNA  stayed  in  the 
aqueous  layer. 

ETHANOL  PRECIPITATION.  The  aqueous  layer  was  divided  into  3 
microcentrifuge  tubes  (1.5  ml  capacity)  so  that  each  tube 
contained  about  700  ul.  Added  70  ul  of  3  M  NaAcetate  pH  4.6  and 
700ul  cold  isopropanol.  Huge  pellets  came  down  after  a  3  min 
centrifugation  in  the  microfuge.  The  pellet  was  washed  with  1  ml 
of  70%  cold  ethanol.  The  DNA  pellet  was  air  dried  for  3  to  5h. 

ESTIMATION  OF  DNA  YIELD.  The  pellet  was  incubated  overnight 
in  400  ul  of  20  mM  TrisCl  pH  7.5,  0.1  mM  EDTA  at  room 
temperature.  The  resulting  solution  was  viscous.  Absorbance  of  a 
1:100  diluted  sample  was  measured  at  260  nm  and  280  nm.  The  DNA 
concentration  was  calculated  from  absorbance  at  260  nm,  where  an 
OD  of  1  corresponded  to  50  ug/ml  for  double  stranded  DNA.  The 
260/280  absorbance  ratio  of  pure  DNA  was  1.8.  Contamination  by  RNA 
gave  a  ratio  higher  than  1.8.  Contamination  by  protein  or  phenol 
gave  a  ratio  lower  than  1.8.  The  yield  was  2  to  5  mg  of  DNA. 
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Restriction  mapping .  The  purified  DNA,  still  attached  to 
lambda  gtll  arms,  was  digested  with  restriction  enzymes  purchased 
from  Boehringer  Mannheim,  BRL,  or  Amersham.  The  digested  DNA  was 
electrophoresed  on  0.8%  agarose  (BRL,  ultrapure) ,  stained  with 
ethidium  bromide,  photographed,  and  transferred  to  nitrocellulose 
by  the  method  of  Southern ( 12 ) .  The  DNA  bands  containing  the  human 
cDNA  were  identified  by  hybridizing  with  various  probes  from  the 
list  in  Table  1. 

DNA  sequencing.  The  2.4  kb  fragment  produced  by  digestion  of 
the  lambda  gtll  clone  with  Kpn  1  and  Eco  Rl  was  transfected  into 
Ml3mpl9.  The  2.4  kb  clone  was  partially  sequenced  by  the  dideoxy 
chain  termination  method  of  Sanger  (13).  Subclones  of  the  2.4  kb 
fragment  were  prepared  by  isolating  the  Kpn  1-Eco  Rl  fragment  from 
agarose  gel  by  electroelution,  and  then  digesting  with  Sau  3A1. 
The  Sau  3A1  digest  was  ligated  into  the  isoschizomer ic  Bam  Hi  site 
of  Ml3mpl8.  These  subfragments  were  prepared  because  the  2.4  kb 
clone  was  too  long  to  sequence  to  completion  when  it  was  intact. 
60  clear  plaques  were  selected  at  random  for  preparation  of  the 
single  stranded  recombinant  M13  to  be  used  for  DNA  sequencing. 
All  60  subclones  were  sequenced  by  the  dideoxy  method  of  Sanger. 

Computer .  We  used  the  Nucleic  Acid  Query  Program  of  the 
Protein  Identification  Resource  ,  National  Biomedical  Research 
Foundation,  Georgetown  University  Medical  Center,  3900  Reservoir 
Road,  N.W.,  Washington,  D.C.  20007  phone  202  625-2121.  This 
program  allowed  us  to  store  and  align  our  DNA  sequence  files,  and 
to  compare  the  DNA  sequence  of  our  clone  to  those  in  the  computer 
databank . 


1 


8mix 


17mer  WPEWMG 


430-435 


5 1 TGQCCNCAATGGATGGG  3' 
G 


2. 

8mix 

17mer 

WPEWMG 

430-435  Reverse  complement 

5 '  CX3CATOCATTGNGGCX2A 
C 

3. 

16mix 

17mer 

WKNQFN 

557-562  5 ' TGGAAAAAOCAATTCAA 

G  T  G  T 

4. 

32mix 

17mer 

KEFQEG 

348-353  5 ' AAAGAATTCCAAGAAGG 

G  G  T  G  G 

5. 

4mix 

17mer 

MMDWKN 

554-559  Reverse  complement 

5 '  TTTTTOCAGTOCATCAT 
C  A 

6. 

39mer 

39mer 

EWGNNAFETYFEH  1  411-423  Reverse  complement 

5 ' ATGTTCAAAATAATAAAAAAAAQCATTATTTOCCCATTC 

7. 

53mer 

53mer  WPEWMGVMHGYEIEFVFG  430-447  Reverse  complement 

5 '  CCAAAGACAAATTCAATTTCATATCCATGCATGACTCXXIATCXIIATTCAGGGCA 

8. 

AEW 

29mer 

AEWEWKAGFH 

539-548  Reverse  complement 

5 ' TGIAAICCIGCITTCCAITCCCAITCIGC 

9. 

WNN 

29mer 

WNNYmDWKN 

550-559  Reverse  complement 

5 '  TTITTOCAITCCATCATITAITTITTOCA 

10. 

GQF 

23mer 

GQFKKTQI 

310-317  Reverse  complement 

5 ' ATITGIGTITTITTIAAITGIOC 

11. 

GVN 

29mer 

GVNKDEGTAF 

320-329  Reverse  complement 

5 '  AAIGCIGTICCITCITCITTITTIACIC3C 

12. 

VYG 

20mer 

VYGAPGF 

331-337  Reverse  caiplement 

5 ' AAICCIGGIQCIOCITAIAC 

3' 


3* 

3' 


3' 


3' 


3' 


3' 


3' 

3' 


3' 


3' 


Single  letter  code  for  amino  acids:  A=alanine,  C=cysteine,  D=aspartic 
acid,  E=glutamic  acid,  F=phenylalanine,  G=glycine,  H=histidine,  1= 
isoleucine,  K=lysine,  L=leucine,  M=methionine,  N=asparagine,  P= 
proline,  Q=glutamine,  R=arginine,  S=serine,  T=threonine,  V=valine, 
W-tryptophan ,  Y=tyrosine.  Probe  sequence:  N=A,C,G  or  T. 


Table  2.  Location  of  oligonucleotide  probes  in  the 
amino  acid  sequence  of  human  serum  cholinesterase. 


50 

EDDI I IATKNGKVRGMNLTVFGGTVTAFLGI PYAQPPLGRLRFKKPQSLTKWSDIWNATK 
.  .  .  100 

YANSCCQNIDQSFPGFHGSEMWNPNTDLSEDCLYLNVWIPAPKPKNATVLIWI YGGGFQT 
.  .  150 

GTSSLHVYDGKFLARVERVIVVSMNYRVGALGFLALPGNPEAPGNMGLFDQQLALQWVQK 
.  200  .... 
NIAAFGGNPKSVTLFGESAGAASVSLHLLSPGSHSLFTRAILQSGSFNAPWAVTSLYEAR 
250  ...  300 

NRTLNLAKLTGCSRENETEI IKCLRNKDPQEILLNEAFVVPYGTPLSVNFGPTVDGDFLT 
.  .  .  .  350 

DMPDILLELGQFKKTQILVGVNKDEGTAFLVYGAPGFSKDNNSI ITRKEFQEGLKIFFPG 

#10  #11  #12  #4 

.  .  .  400  .  « 

VSEFGKES ILFHYTDWVDDQRPENYREALGDWGDYNFICPALEFTKKFSEWGNNAFFYY 

V 

#6 

.  .  450  ... 

FEHRSSKLPWPEWMGVMHGYEIEFVFGLPLERRDNYTKAEEILSRSIVKRWANFAKYGNP 

- #  1 ,  #  2 

- - #7 

.  500  .... 

NETQNNSTSWPVFKSTEQKYLTLNTESTRIMTKLRAQQCRFWTSFFPKVLEMTGNIDEAE 

550  .  .  574 

WEWKAGFHRWNNYMMDWKNQFNDYTSKKESCVGL 

- #9 

#8  - #5 

- #3 


Refer  to  Table  1  for  the  probe  sequence  corresponding  to  each 
probe  #.  A  dot  is  above  every  tenth  amino  acid. 


RESULTS 


One  positive  clone  was  found  out  of  150,000  screened.  The 
positive  clone  hybridized  with  at  least  4  different  probes.  It 
hybridized  with  the  53mer  probe  (#7  in  Table  1) ,  the  39mer  probe 
(#6  in  Table  1),  with  one  or  more  probes  in  the  set  containing  #8, 
#10,  and  #12  ,  and  with  one  or  more  probes  in  the  set  containing 
#4,  #11,  and  #9.  The  positive  clone  was  called  Z3.  Clone  Z3  was 
plaque  purified  by  replating  it  two  more  times.  On  the  second 
replating,  4%  of  the  plaques  hybridizd  with  the  53mer  probe,  and 
on  the  third  replating  100%  of  the  plaques  hybridized. 

Clone  Z3  was  cultured  in  1  liter  of  Y1088  bacteria,  and  the 
phage  DNA  purified.  To  determine  the  size  of  the  human  DNA 
insert,  clone  Z3  was  digested  with  Kpn  1,  Sst  1,  Eco  Rl  and  with 
combinations  of  these  enzymes.  The  digested  DNA  was 
electrophoresed  on  0.8%  agarose,  transferred  to  nitrocellulose, 
and  hybridized  with  the  53mer  probe.  Fig.  1  shows  that  digestion 
with  Eco  Rl  alone  did  not  yield  the  human  DNA  insert.  Eco  Rl  had 
been  expected  to  cut  out  the  human  cDNA  because  the  cDNA  had  been 
ligated  to  the  vector  via  Eco  Rl  linkers.  This  would  have  given  a 
minimum  of  3  fragments:  the  cDNA  fragment,  the  19.6  kb  left  arm 
and  the  24.1  kb  right  arm  of  lambda  gtll.  The  two  large  fragments 
obtained  after  digestion  with  Eco  Rl  suggested  that  Eco  Rl  cut  at 
only  one  end  of  the  cDNA.  The  digestion  with  Kpn  1  and  Eco  Rl 
yielded  a  2.4  kb  fragment  of  which  1  kb  belonged  to  lambda  phage 
and  1.4  kb  was  human.  The  Kpn  1  and  Sst  1  digest  yielded  a  3.4  kb 
fragment  which  contained  2  kb  of  lambda  phage  DNA  and  1.4  kb  of 
human  DNA.  The  digestion  with  Eco  Rl  and  Sst  1  verified  the 
result  already  apparent  from  other  digestions  regarding  the 
orientation  of  the  Eco  Rl  site.  The  functional  Eco  Rl  site  was  1 
kb  away  from  the  Sst  1  site,  while  the  faulty  Eco  -Rl  site  was  1  kb 
away  from  the  Kpn  1  site.  Another  conclusion  from  Fig.  1  was 
that  the  1.4  kb  cDNA  contained  no  internal  Eco  Rl,  Kpn  1  or  Sst  1 
sites. 

Clone  Z3  was  ligated  into  M13mpl9  for  DNA  sequencing.  Two 
types  of  primers  were  used:  the  15  b  primer  complementary  to  M13, 
and  a  17  b  oligonucleotide,  probe  #3  in  Table  1,  complementary  to 
the  cDNA.  DNA  sequencing  gave  approximately  450  nucleotides  of 
sequence.  To  complete  the  sequence,  clone  Z3  was  subfragmented 
with  Sau3Al  and  ligated  into  the  isoschizomer ic  Bam  HI  site  of 
M13mpl8.  For  DNA  sequencing  60  clear  plaques  were  selected  at 
random.  These  60  subclones  contained  11  fragments  of 
cholinesterase.  Fig.  2  shows  the  location  of  the  6  Sau  3A1 
restriction  sites  in  clone  Z3.  The  functional  Eco  Rl  site  was  at 
nucleotide  580,  and  the  faulty  Eco  Rl  site  was  at  nucleotide  2025. 
The  faulty  Eco  Rl  site  had  a  single  base  change.  It  had  the 
sequence  AAATTC  instead  of  the  correct  GAATTC.  Despite  the  faulty 


Eco  R1  site,  the  junction  between  human  and  lambda  phage  DNA  was 
recognized  by  the  lac  Z  sequence  which  is  part  of  lambda  gt  11. 

Fig.  2  shows  that  most  of  the  clone  was  sequenced  in  both 
directions.  The  Sau  3A1  subclones  provided  the  greatest  part  of 
the  sequence.  The  uncut  Z3  clone  extended  the  sequence  at  the  5' 
end  and  overlapped  the  3'  coding  strand  with  the  3'  untranslated 
region.  The  alignment  of  the  Sau  3A1  subclones  was  determined  by 
comparing  their  DNA  sequences  to  the  DNA  sequence  of  human  serum 
cholinesterase  predicted  from  its  amino  acid  sequence.  The 
computer  was  used  for  this  purpose. 

Table  3  shows  the  nucleotide  sequence  of  clone  Z3  and  the 
corresponding  amino  acid  sequence.  The  amino  acid  sequence 
predicted  by  clone  Z3  is  a  perfect  match  to  the  amino  acid 
sequence  of  human  serum  cholinesterase  which  we  had  determined  by 
Edman  degradation  (1) . 

Clone  Z3  contains  1440  nucleotides.  These  correspond  to  the 
coding  sequence  of  cholinesterase  from  phenylalanine  195  through 
leucine  574  at  the  carboxyl  terminus,  and  extend  302  bases  into 
the  3'  untranslated  region.  The  stop  codon  immediately  after 
leucine  574  is  TAA. 


Clone  Z3  digested  with  Kpn  I,  Set  I,  and  Eco  RI 
and  hybridized  with  the  53mer  probe. 


K  S  E  Not 
cut 


KS  ES  KE  KSE 


-46  kb 
"21 


Fig.l.  Southern  blot  of  restriction  digested  clone  Z3.  Clone 
Z3  is  in  lambda  gtll.  Clone  Z3  was  digested  with  Kpn  1,  Sst  1, 
Eco  Rl  and  with  combinations  of  these  enzymes.  The  digests  were 
electrophoresed  on  0.8%  agarose,  transferred  to  nitrocellulose, 
and  hybridized  with  the  53mer  probe. 
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Phc  195 


1138  bp  coding  for  Cholinesterase 


,  902  bp 
3  uriimWid 


Fig. 2.  Hestr ict ion  map  of  clone  Z  3  showing  sequencing 
strategy .  Clone  Z3  contains  1440  bases.  1138  bases  are  Tn  the 
coding  region  and  these  code  for  phenylalanine  195  through  leucine 
574  of  human  cholinesterase.  302  bases  are  in  the  3'  untranslated 
region.  The  5'  end  of  clone  Z3  was  defined  by  an  Eco  Rl  site. 
The  3'  end  had  a  single  base  substitution  in  the  Eco  Rl  site,  and 
therefore  was  not  cleaved  by  Eco  Rl.  The  location  of  six  Sau  3A1 
sites  is  indicated  by  S.  Sau  3A1  subclones  were  sequenced  in  both 
directions.  The  uncut  Z3  clone  was  sequenced  using  two  different 
primers:  a  15  b  primer  for  M13,  and  a  17  b  primer  (probe  #3  in 
Table  1)  that  corresponded  to  cholinesterase  amino  acids  557  to 
562. 


Table  3. 
sequence . 


DNA  sequence  of  clone  Z3  and  derived  amino  acid 


Phe 

Gly 

Glu 

Ser 

Ala 

200 

Gly 

Ala 

Ala 

Ser 

Val 

Ser 

Leu 

His 

Leu 

Leu 

Ser 

TTC 

GGA 

GAA 

AGT 

GCA 

GGA 

GCA 

GCT 

TCA 

GTT 

AGC 

CTG 

CAT 

TTG 

CTT 

TCT 

Pro 

Gly 

Ser 

His 

Ser 

Leu 

Phe 

Thr 

Arg 

Ala 

lie 

Leu 

Gin 

Ser 

Gly 

Ser 

CCT 

GGA 

AGC 

CAT 

TCA 

TTG 

TTC 

ACC 

AGA 

GCC 

ATT 

CTG 

CAA 

AGT 

GGA 

TCC 

Phe 

Asn 

Ala 

Pro 

Trp 

Ala 

Val 

Thr 

Ser 

Leu 

Tyr 

Glu 

Ala 

Arg 

Asn 

Arg 

TTT 

AAT 

GCT 

CCT 

TGG 

GCG 

GTA 

ACA 

TCT 

CTT 

TAT 

GAA 

GCT 

AGG 

AAC 

AGA 

Thr 

Leu 

Asn 

Leu 

Ala 

Lys 

Leu 

Thr 

Gly 

Cys 

Ser 

Arg 

Glu 

Asn 

Glu 

Thr 

ACG 

TTG 

AAC 

TTA 

GCT 

AAA 

TTG 

ACT 

GGT 

TGC 

TCT 

AGA 

GAG 

AAT 

GAG 

ACT 

Glu 

He 

lie 

Lys 

Cys 

Leu 

Arg 

Asn 

Lys 

Asp 

Pro 

Gin 

Glu 

lie 

Leu 

Leu 

GAA 

ATA 

ATC 

AAG 

TGT 

CTT 

AGA 

AAT 

AAA 

GAT 

CCC 

CAA 

GAA 

ATT 

CTT 

CTG 

Asn 

Glu 

Ala 

Phe 

Val 

Val 

Pro 

Tyr 

Gly 

Thr 

Pro 

Leu 

Ser 

Val 

Asn 

Phe 

AAT 

GAA 

GCA 

TTT 

GTT 

GTC 

CCC 

TAT 

GGG 

ACT 

CCT 

TTG 

TCA 

GTA 

AAC 

TTT 

Gly 

Pro 

Thr 

Val 

Asp 

Gly 

Asp 

Phe 

Leu 

300 

Thr 

Asp 

Met 

Pro 

Asp 

lie 

Leu 

GGT 

CCG 

ACC 

GTG 

GAT 

GGT 

GAT 

TTT 

CTC 

ACT 

GAC 

ATG 

CCA 

GAC 

ATA 

TTA 

Leu 

Glu 

Leu 

Gly 

Gin 

Phe 

Lys 

Lys 

Thr 

Gin 

lie 

Leu 

Val 

Gly 

Val 

Asn 

CTT 

GAA 

CTT 

GGA 

CAA 

TTT 

AAA 

AAA 

ACC 

CAG 

ATT 

TTG 

GTG 

GGT 

GTT 

AAT 

Lys 

Asp 

Glu 

Gly 

Thr 

Ala 

Phe 

Leu 

Val 

Tyr 

Gly 

Ala 

Pro 

Gly 

Phe 

Ser 

AAA 

GAT 

GAA 

GGG 

ACA 

GCT 

TTT 

TTA 

GTC 

TAT 

GGT 

GCT 

CCT 

GGC 

TTC 

AGC 

Lys 

Asp 

Asn 

Asn 

Ser 

lie 

lie 

Thr 

Arg 

Lys 

Glu 

Phe 

Gin 

Glu 

Gly 

Leu 

AAA 

GAT 

AAC 

AAT 

AGT 

ATC 

ATA 

ACT 

AGA 

AAA 

GAA 

TTT 

CAG 

GAA 

GGT 

TTA 

Lys 

lie 

Phe 

Phe 

Pro 

Gly 

Val 

Ser 

Glu 

Phe 

Gly 

Lys 

Glu 

Ser 

lie 

Leu 

AAA 

ATA 

TTT 

TTT 

CCA 

GGA 

GTG 

AGT 

GAG 

TTT 

GGA 

AAG 

GAA 

TCC 

ATC 

CTT 

Phe 

His 

Tyr 

Thr 

Asp 

Trp 

Val 

Asp 

Asp 

Gin 

Arg 

Pro 

Glu 

Asn 

Tyr 

Arg 

TTT 

CAT 

TAC 

ACA 

GAA 

TGG 

GTA 

GAT 

GAT 

CAG 

AGA 

CCT 

GAA 

AAC 

TAC 

CGT 

Glu 

Ala 

Leu 

Gly 

Asp 

Val 

Val 

Gly 

Asp 

Tyr 

Asn 

Phe 

lie 

400 

Cys 

Pro 

Ala 

GAG 

GCC 

TTG 

GGT 

GAT 

GTT 

GTT 

GGG 

GAT 

TAT 

AAT 

TTC 

ATA 

TGC 

CCT 

GCC 

Leu 

Glu 

Phe 

Thr 

Lys 

Lys 

Phe 

Ser 

Glu 

Trp 

Gly 

Asn 

Asn 

Ala 

Phe 

Phe 

TTG 

GAG 

TTC 

ACC 

AAG 

AAG 

TTC 

TCA 

GAA 

TGG 

GGA 

AAT 

AAT 

GCC 

TTT 

TTC 

Tyr 

Tyr 

Phe 

Glu 

His 

Arg 

Ser 

Ser 

Lys 

Leu 

Pro 

Trp 

Pro 

Glu 

Trp 

Met 

TAC 

TAT 

TTT 

GAA 

CAC 

CGA 

TCC 

TCC 

AAA 

CTT 

CCG 

TGG 

CCA 

GAA 

TGG 

ATG 

2«! 


Table  3.  continued 


Gly  Val  Met  His  Gly  Tyr  Glu  lie  Glu  Phe  Val  Phe  Gly  Leu  Pro  Leu 
GGA  GTG  ATG  CAT  GGC  TAT  GAA  ATT  GAA  TTT  GTC  TTT  GGT  TTA  CCT  CTG 


Glu  Arg  Arg  Asp  Asn  Tyr  Thr  Lys  Ala  Glu  Glu  lie  Leu  Ser  Arg  Ser 
GAA  AGA  AGA  GAT  AAT  TAC  ACA  AAA  GCC  GAG  GAA  ATT  TTG  AGT  AGA  TCC 


lie  Val  Lys  Arg  Trp  Ala  Asn  Phe  Ala  Lys  Tyr  Gly  Asn  Pro  Asn  Glu 
ATA  GTG  AAA  CGT  TGG  GCA  AAT  TTT  GCA  AAA  TAT  GGG  AAT  CCA  AAT  GAG 


Thr  Gin  Asn  Asn  Ser  Thr  Ser  Trp  Pro  Val  Phe  Lys  Ser  Thr  Glu  Gin 

ACT  CAG  AAC  AAT  AGC  ACA  AGC  TGG  CCT  GTC  TTC  AAA  AGC  ACT  GAA  CAA 

500 

Lys  Tyr  Leu  Thr  Leu  Asn  Thr  Glu  Ser  Thr  Arg  lie  Met  Thr  Lys  Leu 

AAA  TAT  CTA  ACC  TTG  AAT  ACA  GAG  TCA  ACA  AGA  ATA  ATG  ACG  AAA  CTA 


Arg  Ala  Gin  Gin  Cys  Arg  Phe  Trp  Thr  Ser  Phe  Phe  Pro  Lys  Val  Leu 
CGT  GCT  CAA  CAA  TGT  CGA  TTC  TGG  ACA  TCA  TTT  TTT  CCA  AAA  GTC  TTG 


Glu  Met  Thr  Gly  Asn  lie  Asp  Glu  Ala  Glu  Trp  Glu  Trp  Lys  Ala  Gly 
GAA  ATG  ACA  GGA  AAT  ATT  GAT  GAA  GCA  GAA  TGG  GAG  TGG  AAA  GCA  GGA 


His  Arg  Trp  Asn 
CAT  CGC  TGG  AAC 


Asn  Tyr  Met  Met  Asp 
AAT  TAC  ATG  ATG  GAC 


Cys  Val 
TGT  GTG 
ctt  tag 
aaa  agt 
ttc  etc 
atg  get 
taa  tta 
tta  ata 


Trp  Lys 
TGG  AAA 
574 
Gly  Leu 
GGT  CTC 
ate  aag 
tat  tat 
aca  tet 
ag a  aca 
tgt  gca 
aat  tta 


Asn  Gin  Phe  Asn 
AAT  CAA  TTT  AAC 


taa  tta 
gca  aaa 
gta  get 
tta  act 
tgt  tta 
tat  taa 
agt  ttt 


Amino  acids  are  numbered  according  to  their  location  in  the 
complete  sequence  of  human  serum  cholinesterase. 
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DISCUSSION 


We  have  succeeded  in  isolating  a  cDNA  clone  for  human 
cholinesterase.  Our  strategy  was  based  on  our  knowledge  of  the 
complete  amino  acid  sequence  of  human  serum  cholinesterase.  The 
amino  acid  sequence  was  determined  in  our  laboratory  and  is  not 
yet  published,  though  it  is  in  press  (1).  The  amino  acid  sequence 
allowed  us  to  prepare  oligonucleotide  probes  to  use  for  screening 
gene  libraries.  Our  most  successful  probe  was  the  unique  sequence 
53mer  probe.  It  could  be  hybridized  at  a  fairly  high  stringency 
of  46°C.  The  inosine  probes,  #8-#12  in  Table  1,  were  the  least 
useful  because  they  hybridized  at  lower  temperatures  than  the 
mixed  oligonucleotide  probes,  #l-#5  in  Table  1.  Our  12  probes  are 
clustered  near  the  carboxy  terminus  portion  of  cholinesterase  (see 
Table  2) .  This  was  partly  by  chance  because  the  first  probes 
synthesized  (#1 , 2 , 3 , 4 , 6 , 7)  had  unknown  locations  within  the 
complete  sequence.  Probes  #5,8,9,10,11,12  were  designed  to  be 
near  the  carboxy  terminus  because  we  expected  this  region  to  be 
better  represented  in  a  cDNA  library. 

We  screened  4  different  human  cDNA  libraries:  a  liver,  muscle, 
lymphocyte  cell  line  GM2131A,  and  a  brain  library.  We  found  a 
cholinesterase  clone  only  ink  the  brain  basal  ganglia  library, 
despite  the  fact  that  each  of  the  other  tissues  has  been  reported 
to  synthesize  cholinesterase  (14,15,16).  It  is  possible  that  the 
other  cDNA  libraries  also  contained  cholinesterase  and  that  we 
missed  the  cholinesterase  clone  by  chance. 

Clone  Z3  contained  1440  nucleotides.  The  amino  acid  sequence 
of  this  clone  corresponded  perfectly  to  amino  acids  195-574  of 
human  serum  cholinesterase.  It  was  especially  reassuring  to  find 
asparagines  in  positions  241,  256,  341,  455,  481,  '  and  486  because 
our  amino  acid  sequencing  paper  had  reported  that  these 
asparagines  were  glycosylated.  Glycosylated  asparagines  do  not 
show  up  during  amino  acid  sequencing  and  therefore  identification 
was  by  indirect  arguments.  Our  cloning  results  support  the 
conclusion  that  these  are  glycosylated  asparagines. 

The  perfect  match  of  the  amino  acid  sequence  predicted  from 
clone  Z3  with  the  amino  acid  sequence  of  human  serum 
cholinesterase  leads  to  the  conclusion  that  human  brain 
cholinesterase  is  identical  to  human  serum  cholinesterase,  at 
least  in  the  region  of  amino  acids  195  to  574.  This  is  the  first 
evidence  that  cholinesterases  from  two  different  tissues  are 
identical . 

The  only  published  nucleotide  sequence  of  a  cholinesterase  is 
the  sequence  of  acetylcholinesterase  from  the  electric  organ  of 
Torpedo  cal i f ornica  (17) .  No  human  acetylcholinesterase  sequence 


is  available  for  comparison,  nor  any  mammalian 
acetylcholinesterase.  Acetylcholinesterase  is  considered  to  be 
distinct  from  cholinesterase,  though  the  structural  basis  for  the 
observed  activity  differences  is  unknown.  Comparison  of  the  amino 
acid  sequences  of  acetylcholinesterase  from  Torpedo  and 
cholinesterase  from  human  serum  shows  an  identity  of  53.8%. 
Comparison  of  the  nucleotide  sequences  shows  an  identity  of 
approximately  55%.  These  are  strikingly  high  degrees  of  identity 
when  it  is  considered  that  the  species  are  far  apart  in  evolution, 
and  that  the  two  proteins  are  thought  to  be  products  of  different 
genes.  One  might  speculate  that  human  acetylcholinesterase  will 
be  even  more  similar  to  human  serum  cholinesterase  than  Torpedo 
acetylcholinesterase,  so  that  the  identity  may  turn  out  to  be  as 
high  as  70  to  80%. 

Comparison  of  the  53mer  probe  region  in  Table  4  shows  that  the 
18  amino  acids  in  this  probe  are  identical  in  Torpedo 
acetylcholinesterase  and  human  cholinesterase  except  for  the 
substitution  of  one  isoleucine  for  methionine,  which  is  a  single 
base  change.  However,  the  Torpedo  nucleotide  sequence  differs 
from  the  human  sequence  in  10  nucleotides.  The  53mer  probe  turned 
out  to  be  94.3%  correct  for  human  cholinesterase  but  only  81% 
correct  for  Torpedo  acetylcholinesterase.  This  suggests  that 
frequency  of  codon  usage  in  a  particular  species  is  an  important 
consideration  in  designing  a  probe.  Furthermore,  it  suggests  that 
the  amino  acid  sequence  is  more  conserved  than  the  nucleotide 
sequence. 


FUTURE  PLANS 

We  plan  to  use  a  portion  of  clone  Z3  as  a  probe  to  rescreen 
the  library.  Approximately  120  bp  from  the  5'  end  of  clone  Z3 
will  be  used  to  find  the  overlapping  cDNA  clone.  The  purpose  will 
be  to  find  the  rest  of  the  coding  region  for  cholinesterase  and  to 
find  the  5'  untranslated  region. 


Table  4.  Amino  acid  and  nucleotide  sequences  of  the  53mer  probe, 
human  serum  cholinesterase  and  Torpedo  acetylcholinesterase  in  the 
region  corresponding  to  the  53mer  probe. 


Amino  acid  sequence 
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Nucleotide  sequence 


♦mismatch 


53mer  CCAAAGACAAATTCAATTTCATATCCATGCATGACTCCCATCCATTCAGGCCA 

*  *  * 

CCAAAGACAAATTCAATTTCATAGCCATGCATCACTCCCATCCATTCTGGCCA 
*  **  *  *  *  *  *  *  * 


ChE 

AChE 


CCGAAGACGAACTCAATCTCATAGCCGTGGATGACGCCCATCCACTCCGGCCA 


3 

0 

10 


53mer=unique  sequence  probe  used  for  finding  the  ChE  clone 
ChE=human  serum  cholinesterase;  amino  acids  430  to  447 
AChE=acetylcholinesterase  from  Torpedo  calif ornica; 
amino  acids  432-449 
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